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Component structure induced by a random walk on a 

random graph. 

Colin Cooper* Alan Frieze^ 

March 23, 2011 



Abstract 

We consider random walks on two classes of random graphs and explore the likely 
structure of the vacant set viz. the set of unvisited vertices. Let T(t) be the subgraph 
induced by the vacant set. We show that for random graphs Gn,p (above the connectivity 
threshold) and for random regular graphs Gr, r > 3 there is a phase transition in the 
sense of the well-known Erdos-Renyi phase transition. Thus for t < (1 — e)t* we have 
a unique giant plus logarithmic size components and for t > (1 + e)t* we only have 
logarithmic sized components. 



1 Introduction 



The problem we consider can be described as follows. We have a finite graph G = (V, E), and 
a simple random walk Wu on G, starting at u ^ V. What is the likely component structure 
induced by the unvisited vertices of G at step t of the walk? 

Initially all vertices V oi G are unvisited or vacant. We regard unvisited vertices as colored 
red. When Wu visits a vertex, the vertex is re-colored blue. Let Wu{t) denote the position of 
Wu at step t. Let B(t) = {PV„(0), Wu{l), ■ ■ ■ , Wuit)} be the set of blue vertices at the end of 
step t, and TZuit) = V\Bu(t). Let T(t) be the subgraph of G induced by 7l{t). Initially r(0) 
is connected, unless m is a cut- vertex. As the walk continues, r(t) will shrink to the empty 
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graph once every vertex has been visited. We wish to determine, as far as possible, the hkely 
evolution of the component structure as t increases. In this paper we will consider two models 
of random graph: The random graph Gn,p and the random r-regular graph Gr,r > 3, both 
on vertex set V = [n]. 

Re-call the typical evolution of the random graph Gn,p as p increases from to 1. Initially 
it consists of isolated vertices. As we increase p or equivalently add random edges, we find 
that the maximum component size increases from logarithmic size for p = c/n, c < 1 and then 
after the phase transition for c > 1 there is jump in maximum component size to linear in n 
and all other components have logarithmic size. Our aim in this paper is to show that whp 
r(t) undergoes a reversal of this. In both cases ^(0) is connected and it will start to break up. 
There will be a critical value t* such that if t < t* by a sufficient amount then Q(t) consists 
of a unique giant component plus a collection of components of size O(logn). Once we have 
passed through the critical value, i.e. t > t* by a sufficient amount then all components are of 
size O(logn). The maximum component size will then shrink to zero. We now provide some 
more details. We say that a graph is sub- critical if its maximum component size is 0(log?T,) 
and super-critical if is has a unique component of size Q{n) and all other components are of 
size O(logn). 



We first consier Gn,p- Here we assume that 

p = — where (c — 1) logn -^ oo with n and c = n°'^^K 



Clog'^ .,____._ .^,_____ . __ ..., , _ _„(1) 



n 
Let 

tg = n(loglog?i + {1 + 9) logc). 

Theorem 1. Let e > be a small constant. Then whp we have (i) T{t) is super-critical for 
t < t_e and (a) r(t) is sub-critical for t > t^. 

The bound c = n°^^^ is not optimal. On the other hand there is not going to be a phase 
transition if p is constant. 

Now let us consider Gr for r > 3, constant. 

Let 

r(r — 1) logfr — 1) , , 

t* = -^ T^^, -n. 1 

(r-2)2 ^ ^ 

Theorem 2. Let e > be a small constant. Then whp we have (i) T{t) is super- critical 
for t < (1 — e)t* , (a) r(t) is sub-critical for t > (1 + e)t* and (Hi) at some time t ^ t* the 
maximum degree in V{t) is n^/^+°'^'^\ 

We can also say something about the degree sequence of r(t). Let 

r — 1 __L 

p = and Nt = ne p" . 

r — 2 



Let 

1-1 /fc 1 . prnlogn 

Tfc = n^ ^1" and tk = -rr r • (2) 

k{r-2) + 2 ^ ^ 

Let cj/loglogn — )■ oo and u = o{logn)} 
Theorem 3. Whp 

(a) |7^(t)| = (l + o(l))iV,. 

(b) j;)^^t^^h^ + oa))NtOPta-PtY-' r,_.«t<W-u;n 

1 t <^ Tr-s or t > ts+1 + ton 

0<s<r. 

where Ds(t) is the number of vertices of degree s in r(t) and 

(r-2)t 

The range tg — ojn < t <ts + un contains the times when the number of vertices of degree s 
is constant in expectation and unhkely to be concentrated around its mean. We do not think 
that the asymptotic distribution is Poisson at this point. We can give some more information 
about the number of small components in T{t). Again there is a gap containing the times 
when the expected number of such components is constant. 

Theorem 4. Let e be a small positive constant and let 1 < k < elogn and t < (1 — e)tfc. Let 
N{k,t) denote the number of components ofr{t) with k vertices. Then whp 

^(^^,) ^ ^ .^ ■ «(l))MF^tT2) (tf )pr'(l -P*)'^^^-^)-^^ rHr-2H2 « t < (1 - e)t, 

t < Tk(r-2)+2 ort> (1 + e)tk 

Note that we do not claim that we can prove that the statements in the above hold for all 
t simultaneously, although we suspect they do. Our proofs only show this to be true for 
most values of t. 



1.1 Previous work 

The only previous works on this subject that we are aware of are Benjamini and Sznitman [1], 
Windisch [18] and Cerny, Teixeira and Windisch [5]. Papers [1], [18] deal with the component 
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structure of the vacant set for a random walk on a (i-dimensional torus. The latter deals with 
random walls on Gr- They show that whp r(t) is sub-critical for t > (1 + e)t* and that there 
is a unique linear size component for t < (1 — e)t*. They conjecture that r(t) is super-critical 
for t < [1 — e)t* and we prove this conjecture. On the other hand their proof holds for all 
graphs in some class of regular graphs containing almost all regular graphs. 



2 Uniformity 

The main idea is to reahse that the graph r(t) has a simple distribution. First consider Gn,p- 

Lemma 5. 

Consider a random walk on Gn.p- Conditional on N = \TZ{t)\, r(t) is distributed as G^^p. 

Proof This follows easily from the principle of deferred decisions. We do not have to 
expose the existence or absence of edges between vertices in TZ{t) until one of them is exposed. 

D 

Thus to prove Theorem 1 we only need high probability estimates of |7?.(t)|. 

Now consider G^- We give two structural definitions. 

Lemma 6. 

Consider a random walk on Gr- Conditional on N = |7^(t)| and its degree sequence d = 
dr(t){v),v G TZit), T{t) is distributed as G]\[d, the random graph with vertex set [n] and degree 
sequence d. 

Proof Suppose that we condition on TZ{t) and the history V. = (144(0), 1V„(1), . . . , Wu{t)). 
If Gi,G2 are graphs with vertex set TZ{t) and if they have the same degree sequence then 
substituting G2 for Gi will not conflict with T-L i.e. every extension of Gi is an extension of 
G2 and vice-versa. □ 

Thus to prove Theorem 2 we only need high probability estimates of the degree sequence of 
r(t). The proof of Theorem 4 can in principle be derived from this, although we do not have 
a simple way of doing it. Instead we rely on a further characterisation of r(t). 

We use the configuration or pairing model of BoUobas [3] and Bender and Canfield [4]. We 
start with n disjoint sets of points Wi, W2, ■ ■ ■ , Wn each of size r. We let W = Ur=i ^«- 
A configuration F is a partition of W into rn/2 pairs i.e. a pairing. Q is the set of 
configurations. If F G f2 defines an r-regular multi-graph Gp = {[n\,EF) where Ep = 
{{i,j) : 3 {x, y} & F : X & Wi, y e Wj} i.e. we contract W^ to a vertex i for i G [n]. 



It is known that (i) each (simple) graph arises the same number of times as Gp and (ii) the 
probabihty that Gp is simple is bounded below by a constant, if r is constant. Thus if F is 
chosen uniformly at random from Vt then any event that occurs whp for Gp will occur whp 
for Gr- 

Suppose now that we generate a random F as we do a random walk on [n\. We begin with a 
starting value ii and at the start of tth step we are at some it and we have a partition Rt, Bt 
of W into Red and Blue points respectively. Initially, Ri = W and Bi = 0. In addition we 
have a collection Ft of disjoint pairs from W. Initially Fi = 0. 

We choose x randomly from Wi^ - choose a random edge incident with it. Ii x E Rt we choose 
y randomly from Rt \ {x} and add then let Ft+i = FtU {{x, y}}. Suppose that y G Wj. This 
is equivalent to moving from it G B{t) to it+i = j. We remove {x, y} from R and move them 
to B. 

If on the other hand, x E B then it has previously been paired with a y G Wj fl B and we 
move from it to it+i = j and let Rt+i = Rt, -Bt+i = Bt and we let Ft+i = Ft. 

After t steps we will have constructed a random collection Ft oi < t disjoint pairs from W 
and in principle we can extend it to a random configuration F by adding a random pairing of 
Rt to it. We summarise what we need in a lemma. 

Lemma 7. 

(a) Ft plus a random pairing of Rt is a random member ofQ. 

(b) ^G7^(t) zffW.CRt. 



n 



3 Vacancy probabilities 

As in our previous papers on random walks on random graphs, we make heavy use of Lemma 
8 below. Let P be the transition matrix of the walk and let Pu (v) = Pr{Wu{t) = v) be the 
t-step transition probability. We assume the random walk Wu on G is ergodic, and thus the 
random walk has stationary distribution vr, where vr^, = d{v)/{2m). 

Let A2 be the second eigenvalue of P, and let $£■ be the conductance of G i.e. 

$.= min ^^.s-^P(-^S)^ 



SCV,7rs<l/2 TVs 



Then, 

1 - $G < A2 < 1 - ^ (3) 

\Pl^\x)-n^\<{7rJ^^Yl'\\. (4) 

A proof of this can be found for example in Jerrum and Sinclair [14]. In addition, Friedman 
[11] has shown that whp A2 < (2A/r — 1 + e)/r < 29/30, say. (For (4) we need A2 = Amax 
which can be achieved by making the chain lazy i.e. by not moving with probability 1/2 at 
each step. This has no significant effect on the analysis). 



Let T be such that, for t > T 



In which case we can whp take 



max|pW(x)-vr.| = Op^^^^^ . (5) 



T<1201ogn. (6) 

If inequality (5) holds, we say the distribution of the walk is in near stationarity. Fix two 
vertices u^v. Let ht = Pr(Wu(t) = v) be the probability that the walk W^ visits v at step t. 
Let 



00 



H{z) = J2h,z' (7) 



t=T 



generate ht for t >T. 



We next consider the returns to vertex v made by a walk W^, starting at v. Let rt = 
Pr(Wv(t) = v) be the probability that the walk returns to v at step t = 0, 1, .... In particular 
note that rg = 1, as the walk starts on v. Let 

00 

t=0 

generate rt, and let 

T-l 

RT{z) = Y,r,z^. (8) 



i=o 
Thus, evaluating Rt{z) at 2; = 1, we have RriX) > ^o = 1- 

For t > T let ft = ft{u^v) be the probability that the first visit made to v by the walk Wu 
to V in the period [T, T + 1, . . .] occurs at step t. Let 

F{z) = J2ftz' 

t=T 

6 



generate ft- Then we have 

Hiz) = F{z)Riz). (9) 

The following lemma gives the probability that a walk, starting from near stationarity makes 
a first visit to vertex v at a given step. For proofs of the lemma and its corollary, see [6], [8]. 

Lemma 8. Let R^ = Rt{1), where Rt{z) is from (8). For some sufficiently large constant 
K, let 

where T satisfies (5). Suppose that 



(i) For some constant 9 > 0, we have 



min \Rt(z)\ > 9. 

\z\<l+\ 



(ii) Tvr^ = o(l) and Ttt^ = Vt{n 
There exists 

such that for all t >T, 



^" i?.(l + 0(Tvr,))' ^^^^ 



f,{u^v) = (1 + 0(Tvr.)) /; + 0(Tvr„e-^*/^). (12) 

{^ ~r Pv) 

= (l + 0(T7r,)) ^^ for t> log' n. (13) 

[i -r Pv) 

Corollary 9. For t > T let Av{t) be the event that Wu does not visit v at steps T,T + 1, . . . ,t. 
Then, under the assumptions of Lemma 8, 



[l+P 

;i + o(r7r,)) 



Prw(A(t)) = ' ;^ 'yr + OiT^nvC-'^/') (14) 



.^+Pv 



for t > log n. (15) 



VKe ttse i/ie notation Prw w/ien we wani to emphasise that we are dealing with the probability 
space of walks on G. 



4 The evolution of r(t) in G, 



4.1 Estimates of R^ 

Let 

ii = ei log^ n 

for some sufficiently small ei. A cycle C is small ii \C\ < ii. A vertex is nice if it is at distance 
at least ii from any small cycle. Let A/" denote the nice vertices and A/" denote the vertices 
that are not nice. 

It is straightforward to prove by ffist moment calculations that: 

Whp there are at most n^'^^ vertices that are not nice. (16) 

Whp there are no two small cycles within distance ii of each other. (17) 

A vertex v is tree-like to depth k if A^fc(t') induces a tree, rooted at v. Here Nk{v) denotes the 
set of vertices at distance at most k from v, k > 1. Thus a nice vertex is tree-like to depth 

ii/2. 



Lemma 10. 

(a) // V is nice then 



(b) Ifv is not nice then 



r 
i?^ = (1 + o(l))p where p = — 



Rv< (l+o(l)) — 



Proof (a) Let H denote the subgraph of G induced by Ni^i2{v). This is a tree and we 

can embed it into an infinite r-regular tree T rooted at v. Let X be the walk on T, starting 
from V, and let Xt be the distance of X from the root vertex at step t. 

Let Wq = 0, and let Wt be the distance from w of W in G at step t. We note that we can 
couple >V„, X so that Wt = Xt up until the ffist time that Wt > £i/2.. 

The values of Xt are as follows: Xq = 0, Xi = 1, and if Xj = then Xt+i = 1. If Xt > then 

Xj_i - 1 with probability q = ^ ,^g, 

Xt-i + 1 with probability p = ^-^. 



We note the following result (see e.g. [10]), for a random walk on the line = {0, ...,a} with 
absorbing states {0,a}, and transition probabilities q,p for moves left and right respectively. 
Starting at vertex z, the probability of absorption at the origin is 

8 



provided q < p. 

Let Uoo = {3t > 1 : Xt = 0}, i.e. the event that the particle ever returns to the root vertex in 
T. It follows from (19) with z = 1 and a = oo that 

Pr{U^) = -^. (20) 

r — 1 

It follows that the expected number of visits by A' to f is ^i = p. We write 

r-l 

T OO 

Rv = '^rt and p = '^pt 

t=0 t=0 

where pt = Pr{Xt = v). 

Now rt = Pt for t < ii/2 and part (a) follows from 

T oo 

Y, n = 0(1) and Y. P* = o(l)- (21) 

t=£i/2+l t=£i/2+l 



The first equation of (21) follows from 

1 
< A 



1 

n — 



max 



n 
where Amax is the second largest eigenvalue of the walk. This follows from (4). 

The second equation of (21) is proved in Lemma 7 of [6] where it is shown that 

t=£l/2+l 2j=ei/2+l ^-^ ^ 2j=£i/2+l ^ ^ 

(b) We next note a property of random walks on undirected graphs which follows from results 
on electrical networks (see e.g. Doyle and Snell [9]). Let f be a given vertex in a graph G and 
5" a set of vertices disjoint from v. Let p{G), the escape probability, be the probability that, 
starting at v, the walk reaches S before returning to v. For an unbiased random walk, 

1 

P 



d{v)REFF' 

where Reff is the effective resistance between v and S in G. We assume each edge of G has 
resistance 1. In the notation of this paradigm, deleting an edge corresponds to increasing the 
resistance of that edge to infinity. Thus by Raleigh's Monotonicity Law, if edges are deleted 
from G to form a sub-graph G' then R'eff — Reff- So, if we do not delete any edges incident 
with V then p' < p. 

9 



It follows from (17) that H becomes a tree after removing one edge. We can remove an edge 
not incident with v. By the above discussion on electrical resistance we see that this will not 
decrease Pr([/^), where this is Uoo defined with respect to T* which is T less one edge, not 
incident with v. We can argue crudely that 

P„t;-)<i + !:^ 1 .?. 

This is because there is an ^^ chance of a first move to a part of the tree that has branching 
factor r — 1 at every vertex. 

Let Ul = {X returns to the root vertex after starting at ii/2}. Then, with fx equal to the 
probability of a return by Wt, to v during [1,T], we have 

/T<Pr(f/^)+TPr(f/;). (23) 

The RHS of (23) is at least the probability that W„ returns before reaching distance ii/2 or 
returns after reaching distance ii/2 at some time t < T. 

Now, using (19), we see that 

Here we have £i/4 in place of ii/2 to account for the one place where we move left with 
probability ^:^. We argue that at least one of the paths from i; to w or w to the boundary 
must be at least £i/4 and not use the vertex incident to the deleted edge. 

Thus /t < (2 + o(l))/r and since R^ < j^ we have i?„ < ^^. D 



4.2 Proof of Theorem 3 

We will asssume initially assume that t > log n and deal with the very beginning of the walk 
in Section 4.5. 

Corollary 9 gives the probability of not visiting a single vertex in time [T, t] . We need to extend 
this to certain small sets of vertices. In particular we only need to consider sets consisting of 
V and a subset of its neighbours N{v). Let S be such a subset. 

Suppose now that 5 is a subset of V with 15*1 = o{n). By contracting S to single vertex 
7 = 7(5), we form a graph H = H{S) in which the set S is replaced by 7 and the edges that 
were contained in S are contracted to loops. The probability of no visit to 5 in G can be 
found (up to a multiplicative error of 1 + 0{l/n^)) from the probability of a first visit to 7 in 
H. This is the content of Lemma 11 below. 



10 



We first check the mixing time of a walk on H. Note first that the conductance of H is at 
least that of G. This is because the set of values that we minimise over for if is a subset of 
the set of values that we minimise over for G. It follows that the mixing time for W in iJ is 
also O(logn). 

Lemma 11. [8] Let W„ he a random walk in G starting at u ^ S, and let X^ he a random 
walk in H starting at u ^ '-f. Let T he a mixing time satisfying (5) in hoth G and H . Then 

Pr(^^(t); H) = Pr(A,e5A.(t); G) ( 1 + O ' ^ 



■TV"/'/ V ^t'tJ-^-i'Vy 1 ^-^ J \ ^ \ ^^ \ o 

where the prohahilities are those derived from the walk in the given graph. 

Proof 

Note that m = rn/2 = \E{G)\ = \E{H)\. Let W^{j) (resp. X^{j)) be the position of walk 
Wx (resp. A'xO)) at step j. Let T = G,H and let -P^(x; F) be the transition probability in F, 
for the walk to go from u to a; in s steps. 

Pr{A,{t)-H) = 5^Pj(x;/7)Pr(X,(s-T)^7, T<s<t;i7) 

= E (^(1 + Oin-'))] PriXxis -T)^^,T<s<t;H) (25) 

= J2iPui^;G){l + 0in-')))Pr{Wx{s-T)^S,T<s<t;G) (26) 

= Pr(A,e5A(t);G)(l + 0(l/n3)). 

Equation (25) follows from (5). Equation (26) follows because there is a natural measure 
preserving map between walks in G that start at x ^ S* and avoid S and walks in H that 
start at X 7^ 7 and avoid 7. □ 

Fix V eV and let N{y) = {wi,W2, ■ ■ ■ , Wr} and choose < s < r. We estimate 
Ils{v,t) = Pr>v(t' is a vertex of degree s in F(t)) 

1 Pry^{{v, wi,..., Ws} C TZ{t) and {ws+i, . . . , Wr} C B{t)) (27) 



Thus 

vev 

The next lemma gives us enough information to compute the expected degree sequence of nice 
vertices in 7l(t). 

11 



Lemma 12. WhpQ, for all nice vertices v, (we use the subscript G when we want to empha- 
sise that the probability space is random r-regular graphs). 

(a) Prw(f e n{t)) = (1 + o(l))e-^. 

(b) Pryv{{v, wi,..., Ws} C n{t) and {ws+i, ...,Wr}^ B{t)) = (1 + o(l))e-^K (1 - pty-\ 

Proof Part (a) follows directly from Lemmas 8 and 10. 
For part (b) we can write 

Prwd'W, wi, . . . , Ws} C 7l(t) and {wg+i, ■ ■ ■ , Wr} C B(t)) 

= Y, (-l)""Prw(({^,w^l,•••,w^JUX)C7^(t)) 

X<Z[s+l,r] 

XC[s+l,r] ^^+Plx) 

where |X| = 0, ...,r — s. If s = 0, we suppose that {v, wi, . . . , Wg} = {v}. The term •jx denotes 
the contraction of {v, Wi, . . . , Wg} U X, and 

^A\ , . . ^^ ((^-2)(s +1X1) + r)(r- 2) 

n^^=0{-] and p^^ = {1 + o{l)) ^^ A ^ I i; ^ A ; 



nj r{r — l)n 

Explanation: The expression for p^^ is the product of "-^^ — — -^ the total degree of 7x 
divided by the total degree and the inverse of the expected number of returns to 7x in the 
mixing time. Since v is nice, the expected number of returns (up to a factor 1 + o(l)) is ^37 
where / is the probability of return to the root in an infinite tree with branching factor r — 1 
at each non-root vertex. At the root there are s + |X| loops and (r — 2)(s + \X\) +r branching 
edges. This gives 

^ 2(^+1^1) A _ 2(.+ |X|) \ ^ ^^d 1 _ X ((r-2)(.+ |X|) + r)(r-2) 

(s+|X| + l)r V {s+\X\ + l)r) r -I ^ (s + |X| + l)r(r - 1) 

Thus, for s = 0, 1, ...,r, 

Prwd^', wi, . . . , Ws] C 7l{t) and {wg+i, ■ ■ ■ , Wr} C B(t)) = 

f . . ,Jr-2Ys + r(r-2) ] ^^ , ,1^, f , , ^Ar -2f\X\ \ 

exp -l + ol ^ \ -^ k\ J2 -I'^'exp -l + ol ^ \\ ' t 

I r(r-l)n J ^^t^^ , I r(r - l)n J 



XC[s+l,r] 

(29) 



When t = 0{n) we can write (29) as 



(r-2£i+r(r-2) 1 / f (r - 2)^ 



axp^-(l + o(l)) ' ;^^_^^; ^ t|^l-exp|--^— ^tM +o(l). 

12 



and (b) follows, since the terms above are Q{1). When t/n — > oo we go back to (29) and 
observe that the sum is 1 — o(l) and thus 

Pryv{{v, Wi, . . . , Ws} C 7l(t) and {ws+i, ■ ■ ■ , Wr} C B(t)) = 

r (r-2fs + r(r-2) 

1 + o 1 exp -^ L -^ U 

I r[r — Ijn 

as required. □ 

It follows from Lemmas 8 and 10 that 

whp A/" C B{t) for t > lOeinlogn. (30) 

Lemma 12 verifies that Nt and Ds(t) have expectations asymptotic to the claimed values in 
Theorem 3. The expressions in this lemma only apply to nice vertices. But (30) allows us to 
use them for t >t2- On the other hand, for t <t2 we have 

E(D,(t)) > nl/2 ^ ^2ei 

and the vertices of A/" are asymptotically negligible. We will therefore ignore them in compu- 
tations concerning Ds{t) from now on. 

In particular Lemma 12 and the Markov inequality already show that 

Ds(t) = whp for s > and t > t^+i + wn or t ^ r^-^. (31) 

Thus whp r(t) consists of isolated vertices only, from ti until the cover time. 

We now need to prove concentration for Nf and Ds{t). 

Let TZj^it) = TZit) n A/" and Zj^(t) = \7lj^(t)\. We will use the Chebyshev inequality. Suppose 
that t < ti- We will show that 

Var(Z^(t)) = 0(r"E(Z^(t))) + e~'^-E{Z^{t))y . (32) 

for some constant a > 0. 

Fix t < ti and let S^, be the event that vertex v G TZ{t). Let u be as claimed just before 
Theorem 3. We claim that ii v,w are at distance at least u then 

Pr{S, n S^) = (1 + e-^("))Pr(^,)Pr(£^). (33) 

We use Lemma 11. Let S = {v,w}. We argue that for the random walk on the associated H 

we have 

2 2 

R^ = p + e"^^'^) and vr^ = - and hence p^ = (1 + OiTr-"^)) — . (34) 

n pn 
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But the expression for p^ is clear and the expression for R^ can be demonstrated using the 
proof of Lemma 10. In this calculation one has to estimate the expected number of returns 
from V io V, w io w and visits from v to w and vice-versa during the mixing time. The latter 
contributes 0(Te~^''"-') = 0(e~^'^'^'') < e~"'^ because v and w are at distance at least u. 

Equation (33) follows from Lemmas 8, 11 and (34). Thus, 

E(Z_^(t)) = E(Z^(t)) + Y. Pr(4nfJ+ Y. Pr(4n^J 

V ,w.dist{v ,w)>uj v,w.dist{v,w)<u} 

< EiZ^m + (1 + e-"-)E(Z^(t))2 + r'^EiZ^it)) 
and (32) follows. 
Applying the Chebyshev inequality we see that 

Pr(|Z^(t) - E{Z^m > E(Z^(t))e— /3) < / + e^ /^ = o(l) 

for t < ta. 

This completes the proof of part (a) of Theorem 3. 

The proof of part (b) is similar to that of (a). Observe first that Lemma 12 implies 

E{D,{t)) = (1 + o{l))N, Qptil - p,r-\ (35) 

It follows from (35) that 

E{Dsit)) = fi(r'^/^--=) for t < t,+i - un 

We can use the Chebyshev inequality to prove concentration. We let J^^(s) be the event that 
f is a vertex of degree s in T(t). We prove that 

Pr{J^,{s) n J-^(s)) = (1 + e-^('^))Pr(J-,(s))Pr(J-^(s)) 

whenever v, w are at least u apart. We can argue for this by a small change in the argument 
for (a). This proves concentration for t < tg+i — oon and proves part (b). □ 

4.3 Proof of Theorem 2 

We combine Lemma 6 and Theorem 3 with the results of Molloy and Reed [16, 17]. We 
summarise what we need from the first two papers: 
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Theorem 13. Let Aq, Ai, . . . , A^ € [0, 1] be such that Aq + Ai + ■ ■ ■ + A^ = 1. Suppose that 
d = di,d2, ■ ■ . ^d^ satisfies \ {j : dj = s}\ = (1 + o(l))AsA^ for s = 0, 1, . . . , r. Let G„ d be 
chosen randomly from graphs with vertex set [N] and degree sequence d. Let 

r 

L = ^s{s -2)Xs. 

s=l 

(a) If L < then whp Gn,d is sub-critical. 

(b) If L > then whp Gn,d is super- critical. Furthermore the unique giant component has 

size On where 6 is defined as follows: Let A = ^^^isA^. Define a to be the smallest 
positive solution to 

s/2 



A-2"-E»^-(l-f)' =0 

s=l ^ ^ 



Then 



-5:m-I) 



s=l 



1/2 



n 

We now evaluate A in the context of r(t). Then Theorem 3 imphes that we can take 

^ = E Qk(1 -Pt)^"^5(5 - 2) = rp,{{r - l)p, - 1). 

Thus the critical value for t is the one that gives pt = ^. One can easily check that this is 
indeed the case for t* as defined in (1). parts (i) and (ii) of Theorem 1 follow immediately. 

To prove (iii) we use the result of Hatami and Molloy [13] that when \L\ = 0{n~^^^) the size 
of the giant is n'^/^~^°W _ \i each step, L changes by 0{l/n) and so at some time t r^ t* the 
conditions of [13] will be satisfied. At this point whp there are 0(n) vertices in r(t) and (iii) 
follows. □ 



4.4 Proof of Theorem 4 

In this section we study the number of components of a given size. In principle one should 
be able to work this out from Lemma 6 and Theorem 3. This has proven more difficult than 
we anticipated. Instead, we try to estimate the number directly. We can use these lemmas 
though to argue that almost all small components are trees. Indeed if we fix t and condition 
on the values Dg = Ds(t) satisfying Theorem 3 then we have the following: 

Lemma 14. 
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(a) If t < einlogn then whp there are at most n^'^^ components of size k < eilogn that are 

not trees. 

(b) Ift> einlogn then whp there are no components of size k < log^n that are not trees. 

Proof Let A^ = |7^(t)|. Applying Lemma 6 we see that the expected number of sets of k 
vertices that contain at least k edges is bounded by 



N'\ fQ^ fr\'' frNe 



2\ '^ 



k J \ k J \nJ \ 2n 

To prove (a) we take N = n and apply the Markov inequality. To prove (b) we take N = Nt. 

a 

With this in mind we concentrate on the number of tree components of size k for some 
k < ei log n. Since there are whp at most n"^^^ vertices that are not nice, we will concentrate 
on counting the number of components that are made up of nice vertices only. We will also 
assume that t <ti, see (31). 

The following is proved in [2] - Lemma 4. 

Lemma 15. Let bk be the number of subtrees of size k rooted at a vertex v in an infinite 
r -regular tree T . Then 

r f{r — l)k^ 



D 



(r-2)A; + 2 V k-1 



Now consider the situation described in Lemma 7. Fix v G TZ{t) and consider the k neigh- 
bourhood of V in the multi-graph on [n] induced by a random pairing on R{t). It is whp a 
tree. Now delete any edge that corresponds to an edge {x,y) with x G lZ{t),y G B{t). Let T 
be the component that contains v. If T has k vertices then T corresponds to a tree component 
of r(t) with k vertices. 

So, fix a tree T* as counted in Lemma 15 and let us determine the probability that T = T*. 
The total degree of B{t) is (1 + o(l))rA'j and it follows from Theorem 3 that that the number 
of TZ(t) : B(t) edges is (1 -|- o(l))7Vtr(l — pt). So if we choose x G W-jK^t) then it is paired with 
something in Wnit) with probability (1 + o{l))pt, where the o(l) term is o(l/logn). 

It follows that 

Pr(T = T*) = (1 + o{l))pt\l - Ptf"^^"^^. (36) 

To see this, start at the root v and examine the points paired with W^. Each point will have 
probability (1 + o{l))pt of being paired with an element of Wr,(j) and the o(l) term will not 
change by more than 0(log?7,/A^j) = o{n~^^^) as we proceed with this argument. The count 
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in Lemma 15 assumes an ordering of the neighbours of each vertex and by imphcation an 
ordering of W^ and a statement about which members of W^ are paired with 1^7^(4) and which 
should not. Suppose we pair W^ with points from Wx^,i = 1,2, ... ,d. Then we continue by 
pairing up Wx^ and then Wx2 and so on. The factor p^~^ is from the k — 1 times we have to 
pair with WT^^t) and the factor (1 — j)^)(''~2)fc+2 jg fj^Qj^ i^j^g number of times we do not. Note 
that the sizes of unpaired W-jK^t) and WR^i3[t){t) change by 0(log?7,) as we proceed. 

It follows from (36) that 

EiNik,t)) = (l + oil))N,^pt'a -Vtf'-'^'^'. 

It remains to prove concentration around the mean. We use the Chebyshev inequality. We 
fix two vertex disjoint trees Ti,T2 in G. Arguing as above we see that 

2 
Pr(Ti,T2 are components of r(t)) < (1 + o(l)) ] [ Pr(Tj is a component of r(t)). 

We get what we want, provided E(iV(fc,t)) — )■ cxd and it does so for t < (1 — e)tfc. 



4.5 In the beginning 

Using Lemma 6 we see that for 1 < t < log n we have that r(t) is a random graph with a 
degree sequence of the following form: There are n — s vertices of degree r, where s < rt, and 
s vertices of degree < r. If the minimum degree in r(t) is at least one then whp we find that 
r(t) is connected. Indeed, let Vr be the set of vertices of degree r in r(t). We argue that whp 

Vr induces a connected subgraph of T{t). (37) 

Each X G TZit) \ V^ is adjacent to Vr. (38) 

For k even let 

m 



{k/2)\2^/^ 
be the number of ways of partitioning [k] into k/2 pairs. 
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Let m = O(logn) be the sum of the degrees, in r(t), of the vertices in TZ{t) \ K-. Then 
PrmfaUs) < y f f "1 h) <^ikr + imn-s-k)r + m-l) 



A;=3 Z=0 

n/2 m / \ / \ ({r(n-s)+m)/2\ 

n\ m\ \ {kr+i)/2 ) 



k \l (r{n-s)+vri\ 

n\ fm\ 1 



fc=3 1=0 

n/2 m 

SEE 

fc=3 /=0 ^ ^ ^ ^ V {fcr+0/2 

"■/^ ™ / \ / \ / ; , ; \ (kr+l)/2 

\r-^\-^ l'n\ im\ [ kr + l ^' 

fc=3 /=0 
= 0(1). 



((r(n-s)+m)/2\ 
\ (kr+l)/2 ) 



k I \ I I \ rin — s) + ra 



Explanation of (39): Choose a set of k vertices S of degree r and / points from the m points 
in W associated with vertices T of degree less than r. (Some of the points associated with 
these latter vertices have already been paired). Now pair up the kr + / points randomly and 
the remaining points randomly. The / points contain the edges between S and T . 

The probability that (38) fails is 0(logn/n). A vertex of Tlit) \ Vr of degree d has an 0{n"'^) 
chance of not being connected to K-- 

So we only have to deal with the possibility that there are isolated vertices in r(t) for t < log n. 
So consider the event 

A{t) = {3v G N{Wt) : V e 7^(t) and N{v) C B{t)} . 

We claim that 

Pr(^(^)) = o ((^r^^) • (40) 

It follows that 

log n 

Pr((38) fails) < J^ Pr(^(t)) = o(l). 

i=l 

To prove (40) fix t and a neighbour v of Wt- Equation (17) implies that there is at least one 
neigbour w of f that is not contained in a small cycle, li w ^ Xt then to reach w the walk 
W must emulate a walk on the infinite tree T that starts at distance £i/2 from the root and 
visits it. This has probability l/(r — 1)^^'^ and this must be inflated by log n to account for 
log n possible starting times, li w = Xt then to visit another neighbour of v then we must 
first reach distance at least ii/2 and then we can repeat the argument and use inequality (24). 
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5 The evolution of r(t) in G 



Because T{t) has the distribution Gjy^p, we only need to get good estimates of A^ and we can 
get these from Lemma 8. Assume that c = n^^'^ where w — t- oo. It is shown in [8] that whp 
R^ = 1 + 0(l/logn) for all v E V. Let coi = log"^' ^ n. For a fixed vertex v we have that its 
degree dciv) satsifies 

Prddciv) - clogn\ > uiiclognY^^) < 2e-'^i/^ 

This follows from Chernoff bounds on the binomial. 

So by the Markov inequality, we see that whp all but 0{ne~^^^^) vertices have degrees in 
the range clogra ± a;i(clogra)^'^. Denote these vertices by A/^. Fix such a vertex, then (15) 
implies that 

Pr{v e n(t)) = (1 + 0(logn/n))e-('+^('/"^))*/" = g-^i+^^i/^i))*/'^ 

if we assume that t ~ n log log n. So if t = to where 9 = 0(1) then 

n 



E(|Ar,|) = (l + o(l))- . 

We can argue as in the proof (33) that ii v,w G A^ are at distance at least uj/2 in G then 

Pr(£„ n S^) = (1 + o(l))Pr(£,,)Pr(£^). (41) 

where £v = {v E TZit)}. 

Then if X = |7^(t) n Afd\ then 

E{X{X - 1)) < E(X)2 + 0{{c\ogn)'^/^)E{X) 

which implies that Var(X) = o(E(X)^) and then the Chebyshev inequality implies that 
X ~ E(X) ~ c^+6\o whp. The vertices outside A^ only contribute o{N) whp and Theorem 
1 follows immediately from this. 
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Component structure of the vacant set induced by a 
random walk on a random graph. 

Colin Cooper* Alan Frieze^ 
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Abstract 

We consider random walks on several classes of graphs and explore the likely structure 
of the vacant set, i.e. the set of unvisited vertices. Let r(t) be the subgraph induced 
by the vacant set of the walk at step t. We show that for random graphs G„,^p (above 
the connectivity threshold) and for random regular graphs Gr, r > 3, the graph r(i) 
undergoes a phase transition in the sense of the well-known Erdos-Renyi phase transition. 
Thus for t < (1 — e)t* , there is a unique giant component, plus components of size 
O(logn), and for t > (1 + e)t* all components are of size O(logTi). For Gn,p and Gr we 
give the value of t*, and the size of T{t). For Gr, we also give the degree sequence of 
r(t), the size of the giant component (if any) of T(t) and the number of tree components 
of r(t) of a given size k = O(logn). We also show that for random digraphs D^.p above 
the strong connectivity threshold, there is a similar directed phase transition. Thus 
for t < (1 — e)i*, there is a unique strongly connected giant component, plus strongly 
connected components of size O(logn), and for t > (1 + e)t* all strongly connected 
components are of size 0{\ogn). 



1 Introduction 



The problem we consider can be described as follows. We have a finite graph G = (V, E), and 
a simple random walk Wu on G, starting at u & V. What is the likely component structure 
induced by the unvisited vertices of G at step t of the walk? 
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Initially all vertices V^ of G are unvisited or vacant. We regard unvisited vertices as colored 
red. When W„ visits a vertex, the vertex is re-colored blue. Let Wuit) denote the position of 
Wu at step t. Let B{t) = {>V„(0), >V„(1), . . . ,>V„(t)} be the set of blue vertices at the end 
of step t, and TZuit) = V \ Bu(t). Let r(t) be the subgraph of G induced by TZ{t). Initially 
r(0) is connected, unless u is a cut-vertex. As the walk continues, r(t) will shrink to the 
empty graph once every vertex has been visited. We wish to determine, as far as possible, the 
likely evolution of the component structure as t increases. In this paper we will consider three 
models of random graphs, with vertex set V = [n] where [n] = {1,2, ... ,n}. These are the 
random graph Gn,p in which each edge of Kn is included independently with probability p, 
the random digraph Dn^p in which each edge of Kn is included independently with probability 
p in each direction, and the random graph G^, r > 3, sampled uniformly at random from the 
set of all simple r-regular graphs. 

Because we consider random walks on random graphs, there are two sources of error in our 
proofs; (i) that we sample a graph G which does not have the properties we need, or (ii) 
that the random walk W does not behave in the way we require. The set of graphs Q' with 
properties we require have measure (1 — o(l)) of the graph space Q. Some of our proofs are 
for walks W on a fixed graph G from the subset Q'. In this case, walks W on G with the 
properties we require have measure (1 — o(l)) of W. In other proofs, we use the method of 
deferred decisions, and reveal only the parts of the graph traversed by the walk. In that way, 
the vacant set TZ{t) of the walk induces a random graph, whose properties we can analyze. 

Apart from 0{-),o{-),Q{-) as a function of n — ;■ oo, where n = |V^|, we use the following: We 
say An <^ Bn or Bn ^ An if An/ Bn — )■ as n — )■ oo, and An ~ Bn if lim^^oo An/ Bn = 1. The 
notation u{n) describes a function tending to infinity as n — )• oo. We use the expression with 
high probability, whp, to mean, with probability 1 — o(log~ n) for any positive constant K. 
The variable n is the size of the vertex set of the graph, and we measure both walk and graph 
probabilities in terms of this. Usually, it will be clear that we are discussing the graph, or the 
walk, but if we wish to stress this point we write whpc or whpw- In the case where we use 
deferred decisions, if |7^(t)| = A^, the whp statements are asymptotic in A^, and we assume 
N{n) — 7- oo. The statement of theorems in this section uses the annealed probability measure 
(graph and walk), i.e. whp relative to both graph sampling and walks. 

We recall the typical evolution of the random graph Gn,p as p increases from to L Initially 
it consists of isolated vertices. As we increase p or equivalently add random edges, we find 
that when p = c/n, the maximum component size is logarithmic for c < 1, followed by a 
phase transition around the critical value c = 1. When c > 1 the maximum component size is 
linear in n, and all other components have logarithmic size. Our aim in this paper is to show 
that whp r(t) undergoes a reversal of this. Thus r(0) is connected and T{t) starts to break 
up as t increases. There is a critical value t* such that if t < t* by a sufficient amount then 
r(t) consists of a unique giant component plus components of size O(logn). Once we have 
passed through the critical value, i.e. t > t* by a sufficient amount, then all components are 
of size 0(log?7,). As t increases, the maximum component size shrinks to zero. We make the 



following definitions. A graph with vertex set Vi is sub- critical if its maximum component 
size is O(logn), and super- critical if is has a unique component Ci(t) of size r2(|Vi(t)|), where 
|^i(^)| ^ log 77,, and all other components are of size O(logn). 

In the case of random digraphs D^p the evolution is more complex. Let q = 1 — (1 — p)^. There 
is a phase transition around q = 1/n (i.e. p ~ l/2n) for the emergence of a giant component 
in the underlying graph Gn,q of Dn.p, and, around p = 1/n (i.e. q ~ 2/n) for the emergence of 
a giant strongly connected component in Dn,,p- We make the following definitions. A digraph 
with vertex set Vi is directed- sub- critical if its maximum strongly connected component size 
is O(logn), and directed- super- critical ii is has a unique strongly connected component Ci{t) 
of size 0(|Vi(t)|), where |Vi(t)| ^ logn, and all other strongly connected components are of 
size O(logn). 

Component structure of vacant set of random graphs Gn,p and Dn,p- 
We first consider Gn,p- We assume that 

np = clogn where (c — 1) logn — > oo with n, and c = n^''^. 

where u = u{n) can be any function tending to infinity with n. 

Let 

te = ?7.(loglogn + {1 ~\- 9) logc). 

Theorem 1. Let e > be a small constant. Then whp in Gn,p we have (i) T{t) is super- 
critical for t < t_e, (ii) r(t) is sub- critical for t > t^. 

In Section 5, we prove that for t = t^, whp the size of 7^(t) is Nit^) = (H-o(l))n/(c-'^+^logn), 
and that T(t) behaves as the random graph GN{te),p- The threshold for the giant component 
in Gn,p is at Np = 1. For 6* < 0, let 6* = — e. Then N{t_^)p = c^ > 1, and there is a giant 
component Gi{t_^) of size VL{N{t_^)) = fi(n/(clogn)). The bound c = n^^^, ignores larger 
values of p. On the other hand there is not going to be a phase transition as p — )■ 1/ logn. 

We next consider -D„,p. Let q = 1 — (1 — p)^. Let T(t) be the directed graph induced by 
the vacant set of -D„ p and let r(t) be the undirected graph induced by the vacant set of the 
underlying graph Gn,q of -D„,p. Theorem 1 holds unaltered for r(t), with nq = clogn. For 
strongly connected components of T(t), we have the following theorem. 

Theorem 2. Let e > be a small constant. Let np = clogn. Then whp in Dn,p we have (i) 
V{t) is directed- super- critical for t < t_^, (ii) T(t) is directed- sub- critical for t > t^. 

Component structure of vacant set of random regular graphs. 

We next consider Gr for r > 3, constant. Let 

r = <r-l)\ogir-l)^_ ^^^ 



Let 






Nt-- 




where p - 


= (r- 


-l)/(r- 


- 2), and let 


(r-2)t 



(2) 



Pt = e f""" ■ (3) 

Theorem 3. Let e > be a small constant. Then whp we have 

(i) r(t) is super- critical for t < (1 — e)t* , and |Ci(t)| = Q{n). Let p = pt, then 

\Ci{t)\r^eNt 

where, 

and where is the smallest positive solution to 

(a) T(t) is sub-critical for t > (1 + e)t* , 
(Hi) At some time t r^ t* the maximum component size in T(t) is n'^/^+''^'^') . 

When r = 3, equations (4), (5) give (j) = {{1 ~ p)/pY and 6 = 1 — (e*/^^") — 1)^. 
We can also say something about |7^(t)| and the degree sequence of T{t). 

Let To = and for fc > 1 let 

i-i/fc 1 , prn log n 

Tk = n ' , and tk = --, ^. , • (6) 

k[r — 1) + r 

Theorem 4. Let e, 5 be small positive constants. Suppose that log log ra <^uj = uj{n) <^ logn. 

(a) whp simultaneously for all t < (1 — e)toj l"^!^)! = (1 + o(l))A^j. This result also holds 
whp for any fixed (1 — e)to "^ t < to — un. 

(b) Let Dgit) be the number of vertices of degree s in T{t). Then whp, simultaneously for all 
0<s<rand for all (r,,_,)i+'5 < t < {1 - e%, 

D,{t) = {l + o{l))N,Qpt{l-pty-'. 

This result also holds whp for any fixed (1 — e)ts < t < tg — uju. 



We make the following remarks. 

• It was proved in [7] that the cover time of Gr is C{Gr) ~ {{r — l)/(r — 2))n\ogn = to- 

• The whp concentration of e.g. |7^(t)| in Theorem 4 holds simultaneously for all t < 
(1 — e)to- The value of e can be made arbitrarily small. We have not attempted to 
optimize these results. It is also true for any given t <tQ — un, but not proven to hold 
for every t. 

• The range tg — un < t < ts + un contains the times when the number of vertices of 
degree s is constant in expectation and unlikely to be concentrated around its mean. 

We can give some more information about the number of small components in T{t). Again 
there is a gap containing the times when the expected number of such components is constant. 



Theorem 5. Let e be a small positive constant. Let 



r,ik,t) = ne-*/(^") — "- -(^''^ ^]%t\^ - 



k{{r-2)k + 2) \ k-l 



Pt 



,fc{r-2)+2 



(a) Let 1 < k < elogn and en < t < (1 — e)tfc_i. Let N{k,t) denote the number of tree 

components o/r(t) withk vertices. Then whp for any given t, N{k,t) = {l+o{l))ri{k,t). 

(b) For k constant, whp, simultaneously for all t in the range en < t < (1 — e)tk-i, the 

number of trees with k vertices is concentrated around the value ri{k,t). This result also 
holds whp for any given t in the range {Tk{.r -2)+2Y^^ < ^ < ^n, where 6 is a small 
positive constant. 

Again we make some remarks. 

• Most small components are trees, as e.g. whp Gr induces at most O(logn) cycles of 
constant size. 

• Theorem 5 holds simultaneously for k constant and any t. We do not claim that Theorem 
5 holds simultaneously for all k = O(logn) and t. Our proof only show this to be true 
for most values of t. 

• The intervals containing trees of size k constant are nested. Thus, whp isolated red 
vertices are the first trees to appear, (at around time n^~^^^), and the last to disappear, 
at the cover time (around to = pnlogn). 



Finally, we state without detailed proof the results on the vacant set arising from the use of 
k simultaneous random walks, k > 1 constant. The structure of rfc(t) is equivalent to r(t') 
where t' = [1 + o{l))kt. In particular tl = t* /k. The reasons for this assertion are based on 
Lemma 9 and Corollary 10. The probability that none of k independent random walks visit 
a vertex v during steps T, ...,t, is the k-th power of the probability that a single random walk 
does not visit v during steps T, ...,t. 



1.1 Previous work 

The only previous works on this subject that we are aware of are Benjamini and Sznitman [1], 
Windisch [21] and Cerny, Teixeira and Windisch [5]. Papers [1], [21] deal with the component 
structure of the vacant set of a random walk on a rf-dimensional torus. Paper [5] deals with 
random walks on Gr- It shows that whp r(t) is sub-critical for t > (1 + e)t* and that there 
is a unique linear size component for t < (1 — e)t*. They conjecture that r(t) is super-critical 
for t < (1 — e)t* and we prove this conjecture. 

Subsequent to our posting a preliminary version on the ArXiV, Cerny and Teixeira [6] have 
used the methods of this paper to give a sharper analysis of r(t) in the critical window. 



2 Uniformity 

The main idea is to realize that the graph r(t) has a simple distribution. First consider Gn,p- 

Lemma 6. 

Consider a random walk on Gn,p- Conditional on N = |7^(t)|, T{t) is distributed as Gjq^p. 

Proof This follows easily from the principle of deferred decisions. We do not have to 
decide the existence or absence of edges between vertices in 7l{t) until one of them is exposed. 

D 

Thus to prove Theorem 1 we only need high probability estimates of |7^(t)|. 

We next consider Gr- We give two structural definitions. 

Lemma 7. 

Consider a random walk on Gr- Let T{t) have vertex set TZ(t) and degree sequence d = 
{dr(t){v), V G Tlit))- Conditional on N = |7^(t)| and degree sequence d, T(t) is distributed as 
GN,d, the random graph with vertex set [N] and degree sequence d. 



Proof Intuitively, if we condition on TZ{t) and the history and if Gi, G2 are graphs with 
vertex set TZit) and if they have the same degree sequence then substituting G2 for Gi will 
not conflict with the history i.e. every extension of Gi is an extension of G2 and vice-versa. 
We now give a formal explanation of this. 

The history of the process can be represented by a sequence X = {Xq, . . . ,Xt^i) where 
X e [r]*. This sequence is to be interpreted as follows. We assume the neighbours ui, ...,Ur oi 
each vertex v are given in some fixed order, e.g. increasing label order, (recall that V = [n]). 
When at the j-th vertex v = WuiJ — 1), the walk moved to the Xj-th neighbour ux oi v, in 
the given order. The probability space for the lemma is uniform over Qr x [i^Y, where Qr is the 
set of r-regular graphs on [n]. Given u = {G,X), we let 7li^,Bi^,E^^,d^ denote the induced 
values of 7^(t), B{t), the edges of G that are incident with B{t) and the degree sequence of the 
graph r^ induced by TZ{t). These quantities are all determined by u. For consistency, we let 
duj{i) be the degree of the z'th vertex of r^^ in numerical order. We can in this way associate 
V^ with Gd„. Fix IZ and a degree sequence d. Let Qua be the set of graphs with vertex set 
IZ and degree sequence d. For a graph H G Qti^ we let VL{H) = {oj -.V^ = H}. Then 

P^iH I X) = H^. (7) 



We argue next that if Hi,H2 G 011,(1 then |r2(ifi)| = \Q{H2)\. The lemma follows from this and 
(7). We define a map = 0^1,//^ ': ^(^1) ^ ^(^2) by 0(G,X) = ((G \ E{Hi)) U E{H2),X) 
i.e. we remove the edges of Hi from G and replace them by the edges of H2. We only have to 
show that 0(G, X) G fl{H2) because then 4)Hi,H2 will have an inverse 4>h2,Hi- So let CU2 = 4>{^i) 
where oji G Q{Hi). We have B^^ = B^^ since we have not changed X and we have not changed 
the set of edges incident with B^^^ . This also means that TZi^^ = TZ^^ and E^-^ = E^^ and hence 
d^i = d^^j. By construction, F^^ = H2 and the lemma follows. □ 

Thus to prove Theorem 3 we only need high probability estimates of the degree sequence of 
r(t). The proof of Theorem 5 can in principle be derived from this, although we do not have 
a simple way of doing it. Instead we rely on a further characterization of F(t). 

We use the configuration or pairing model of Bollobas [3] and Bender and Canfield [4]. We 
start with n disjoint sets of points Wi, W2, ■ ■ ■ , Wn each of size r. We let W = ljr=i ^«- 
A configuration F is a partition of W into rn/2 pairs i.e. a pairing. Q is the set of 
configurations. If F G f2 defines an r-regular multi-graph Gf = {[n],EF) where Ep = 
{{i,j) '■ 3 {x, y} E F : X E Wi, y G Wj} i.e. we contract Wi to a vertex i for i G [n]. 

It is known that: (i) Each simple graph arises the same number of times as Gp- i-e. if 
G, G' are simple, then \{F : G p = G}\ = \{F' : G'p = G']\. (ii) Provided r is constant, the 
probability Gp is simple is bounded below by a constant. Thus if F is chosen uniformly at 
random from Vt then any event that occurs whp for F, occurs whp for Gp, and hence whp 
for Gr- 



Suppose now that we generate a random F using a random walk on [n]. To do this, we begin 
with a starting vertex ii, and at the start of t-th step we are at some vertex if, and have a 
partition Rt, Bt of W into Red and Blue points respectively. Initially, Ri = W and Bi = 0. 
In addition we have a collection Ft of disjoint pairs from W where Fi = 0. 

At step t + 1 we choose a random edge incident with it. Recall that the neighbours of it are 
in a fixed order indexed by 1, ...,r. Obviously it G B(t), as it is visited by the walk, but we 
treat the configuration points in Wi^ as blue or red, depending on whether the corresponding 
edge is previously traversed (blue) or not (red). Let x be chosen randomly from Wi^. There 

are two cases. 

If X e Rt, then the edge is unvisited, so we choose y randomly from Rt \ {x}. Suppose that 
y &Wj. This is equivalent to moving from it G B{t) to it+i = j. If j G B{t) this is equivalent 
to moving between blue vertices on a previously unvisited edge. If j G 7^(t), this is equivalent 
to moving to a previously unvisited vertex. We update as follows. Rt+i = Rt\ {x, y} and 
Bt+i = BtU {x, y}, and Ft+i = FtU {{x, y}}. 

If on the other hand, x ^ Bt then it has previously been paired with a y G Wj fl Bt and we 
move from it to it+i = j without updating. We let Rt+i = Rt, Bt+i = Bt and we let Ft+i = Ft. 

After t steps we will have constructed a random collection Ft of at most t disjoint pairs from 
W. Ft consists of a pairing of Bt, and Rt is unpaired. In principle we can extend Ft to a 
random configuration F by adding a random pairing of Rt to it. The next lemma summarizes 
this discussion. 

Lemma 8. 

(a) Ft plus a random pairing of Rt is a uniform random member ofQ. 

(b) ^G7^(t) iffW.CRt. 



3 Vacancy probabilities 



As in our previous papers on random walks on random graphs, we make heavy use of Lemma 
9 below. Let P be the transition matrix of the walk and let Pu (v) = Pr(>Vu(t) = v) be the 
t-step transition probability. We assume the random walk VV„ on G is ergodic, and thus the 
random walk has stationary distribution n, where 7r„ = d{v)/{2m). 

Suppose that the eigenvalues of P are 1 = Ai > A2 > ■ ■ ■ > A„. Let Amax = max{|Aj| : z > 2}. 
We can make A2 = Amax? if necessary, by making the chain lazy i.e. by not moving with 
probability 1/2 at each step. This has no significant effect on the analysis. Let $g be the 



conductance of G i.e. 

$G= min ^^^^ X K , J _ /gx 

SCV,TTs<l/2 71 g 

Then, 

1 - $G < A2 < 1 - ^ (9) 

\P!-Hx)-7r.\<{7r.M'^'XL.- (10) 

A proof of this can be found for example in Jerrum and Sinclair [16]. 

Mixing time of Gn,p, Gr- Let T be such that, for t > T 

max|pW(x)-..|=0f^^^^^^^^ (11) 

For G = Gn,p and np = clogn, c > 1, whp the conductance $(G) > constant, and so 

T{Gn,p) = 0{\ogn). (12) 



For G = Gr, Friedman [13] has shown that whp A2 < (2-y/r — 1 + e)/r < 29/30, say. In 
which case we can whp take 

r(G',.) < 1201ogri. (13) 

If inequality (11) holds, we say the distribution of the walk is in near stationarity. Fix two 
vertices u, v. Let ht = Pr (Wuit) = v) be the probability that the walk Wn visits v at step t. 

Let 

00 

H{z) = J2htz' (14) 

t=T 

generate ht for t > T. 

We next consider the returns to vertex v made by a walk Wv, starting at v. Let rj = 
Pr(VVt,(t) = v) be the probability that the walk returns to v at step t = 0, 1, .... In particular 
note that rg = 1, as the walk starts on v. Let 

00 

R{z) = J2nz' 

t=0 

generate r^, and let 

T-l 

RT{z) = '^rjZ^. (15) 



Thus, evaluating Rt{z) at z = 1, we have -Rt(I) ^^0 = 1. 

For t > T let ft = ft{u-^v) be the probabihty that the first visit made to v by the walk W^ 
to V in the period [T, T + 1, . . .] occurs at step t. Let 



Hz' 



t=T 

generate ft- Then we have 

H{z) = F{z)R{z). (16) 

The following lemma gives the probability that a walk, starting from near stationarity makes 
a first visit to vertex t; at a given step. For proofs of the lemma and its corollary, see [9]. 
The proof differs from the earlier version given in [7] , in that we only consider first visits to 
a vertex v after the mixing time T. We use the lemma to estimate Fj\TZT{t)\, the expected 
number of vertices unvisited after T, this differs from E|7^(t)| by at most T vertices. 

Lemma 9. Let i?„ = Rj-ll), where Rt{z) is from (15). For some sufficiently large constant 
K, let 

A = T^, (17) 



KT' 



where T satisfies (11). Suppose that 
(i) For some constant 6 > 0, we have 

(ii) TtTjj = o(l) and Tn^ = r2(r2~^). 
There exists 



min \Rt(z)\ > 9. 



^" i?.(l + 0(T7r.))' ^^^^ 

such that for all t>T, 

f\{u^v) = (1 + 0(Tvr.)) /; + 0(T7r„e-^*/^). (19) 

\^ ~T~ Pv) 

= (l + 0(T7r,)) f^^ for t> log' n. (20) 

(^i + p^j 

Corollary 10. Fort > T let Av{t) be the event thatWu does not visit v at steps T, T+1, . . . ,t. 
Then, under the assumptions of Lemma 9, 



Prw(A(t)) = ^ .7 .," + 0(r^7r,e"^*/^) (21) 

\l + Pv) 

(l + 0(T7r,)) 



for t > log^ n. (22) 



Jv 
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We use the notation Pr>v when we want to emphasize that we are dealing with the probability 
space of walks on G . 

Corollary 10 gives the probability of not visiting a single vertex in time [T,t]. We need to 
extend this result to certain small sets of vertices. In particular we need to consider sets 
consisting of v and a subset of its neighbours N{v). Let S be such a subset. 

Suppose now that 5* is a subset of V with l^l = o{n). By contracting S to single vertex 
7 = 7(5'), we form a graph H = H{S) in which the set S is replaced by 7 and the edges that 
were contained in S are contracted to loops. The probability of no visit to 5" in G can be 
found (up to a multiplicative error of 1 + 0{l/n'^)) from the probability of a first visit to 7 in 
H. This is the content of Lemma 11 below. 

We can estimate the mixing time of a random walk on H as follows. Note that the conductance 
of H is at least that of G. As some subsets of vertices of V have been removed by the 
contraction of 5", the set of values that we minimise over, to calculate the conductance of 
H, (see (8)), is a subset of the set of values that we minimise over for G. It follows that 
the conductance of H is bounded below by the conductance of G. The conductance of G is 
constant, (see discussion below (11)), and so using (9), (10), we see that the mixing time for 
W in if is 0{\ogn). 

Lemma 11. [9] Let Wu be a random walk in G starting at u ^ S, and let Xu be a random 
walk in H starting at u ^ ^. Let T be a mixing time satisfying (11) in both G and H . Then 



Pr(^^(t); H) = Pr(A,e5A.(t); G)[l + 0, ^^ 
where the probabilities are those derived from the walk in the given graph. 

Proof 

Note that m = rn/2 = \E{G)\ = \E{H)\. Let >V^(j) (resp. X^{j)) be the position of walk 
Wx (resp. Xx{j)) at step j. Let T = G,H and let P^ix; F) be the transition probability in F, 
for the walk to go from m to x in s steps. 

Pr{A,{t);H) = J2Pl(^-^H)Pr{Xx{s-T)^^,T<s<t;H) 

= E (^(1 + Oin-'))) Pr(X.(^ -T)^^,T<s<t;H) (23) 
= J2iPui^;G){l + 0{n-')))Pr{Wx{s-T)^S,T<s<t;G) (24) 
= Pr(A,e5A(t);G)(l + 0(1/^3)). 
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Equation (23) follows from (11). Equation (24) follows because there is a natural measure 
preserving map between walks in G that start at x ^ S" and avoid S and walks in H that 
start at X 7^ 7 and avoid 7. □ 



4 The evolution of T{t) in Gr 

4.1 Estimates of Ry 

Let 

£1 = ei log^ n, (25) 

for some sufficiently small ei. A cycle C is small ii \C\ < ii. A vertex is nice if it is at distance 
at least £1 from any small cycle. Let TV denote the nice vertices and AT denote the vertices 
that are not nice. 

It is straightforward to prove by first moment calculations that: 

Whp there are at most n^*^^ vertices that are not nice. (26) 

Whp there are no two small cycles within distance ii of each other. (27) 

The results we prove are all conditional on (26) and (27) holding. This can only infiate 
the probabilities of unlikely events by 1 + o(l). This includes events defined in terms of the 
configuration model as claimed in Lemma 8. For example, if a calculation shows that an 
event S has probability at most e in the configuration model, then it has probability 0(e) 
with respect to the corresponding subgraph of G and then we only need to multiply this by 
1 + 0(1) in order to estimate the probability conditional on (26) and (27). We will continue 
relying on this without further comment. 

A vertex v is tree-like to depth k if Nk{v) induces a tree, rooted at v. Here Nk{v) denotes the 
set of vertices at distance at most k from v, k > 1. Thus a nice vertex is tree-like to depth 

ii/2. 



Lemma 12. 

(a) If V is nice then 



r — 1 
Ry = {1 + o(l))p where p = 



•-2' 
where the o(l) term is o(log~ n) for any positive constant K . 

(b) // V is not nice then 

/?, <(l + o(l))^— 
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Proof (a) Let H denote the subgraph of G induced by Ni-^/2{v). This is a tree and we 

can embed it into an infinite r-regular tree T rooted at v. Let X be the walk on T, starting 
from V, and let Xt be the distance of X from the root vertex at step t. 

Let Dq = 0, and let Dt be the distance from i; of W in G at step t. We note that we can 
couple Wv, X so that Dt = Xt up until the first time that Dt > ^i/2. 

The values of Xt are as follows: Xq = 0, Xi = 1, and if X^ = then Xt^i = 1. If X^ > then 

X = f ^*-i ^ ^ ^^^'^ probability q = ^ ,^^. 

\ Xt-i + 1 with probability p = ^^. 

We note the following result (see e.g. [12]), for a random walk on the line = {0, ...,a} with 
absorbing states {0,a}, and transition probabilities q,p for moves left and right respectively. 
Starting at vertex z, the probability of absorption at the origin is 



p(.,a) = M^llfj^<f^y, (29) 

^ ' 1 - (q/p)- - \pj ' ^ ' 



provided q < p. 



Let Uoo = {3t > 1 : Xt = 0}, i.e. the event that the particle ever returns to the root vertex in 
T. It follows from (29) with z = 1 and a = oo that 

Pr(f/oo) = -^. (30) 

r — 1 

It follows that the expected number of visits by A" to f is ^ i = p. We write 

T oo 

Rv = '^rt and p = ^ p^ 

t=o t=o 

where pt = Pr{Xt = v). 

Now Vt = Pt for t < ii/2 and part (a) follows once we prove that 

T oo 

J2 r, = o(l)and ^ Pt = o(l). (31) 

t=£i/2+l t=£i/2+l 

The first equation of (31) follows from 

1 

^ -^max (32) 



1 

n — 

n 



where Amax is the second largest eigenvalue of the walk. This follows from (10). 
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The second equation of (31) is proved in Lemma 7 of [7] where it is shown that 

t=ei/2+l 2j=ei/2+l ^-^ ^ 2j=£i/2+l ^ ^ 

Thus 

i?.=p + 0(TA^/^ + T/n + (8/9^^/2) 

and part (a) follows, (b) We next note a property of random walks on undirected graphs 

which follows from results on electrical networks (see e.g. Doyle and Snell [11]). Let v be 

a given vertex in a graph G and 5* a set of vertices disjoint from v. Let p{G), the escape 

probability, be the probability that, starting at v, the walk reaches S before returning to v. 

For an unbiased random walk, 

1 

d{v)REFF' 

where Reff is the effective resistance between v and 5* in G. We assume each edge of G has 
resistance 1. In the notation of this paradigm, deleting an edge corresponds to increasing the 
resistance of that edge to infinity. Thus by Raleigh's Monotonicity Law, if edges are deleted 
from G to form a sub-graph G' then R'eff — Reff- So, if we do not delete any edges incident 
with V then p' < p. 

It follows from (27) that H becomes a tree after removing one edge. We can remove an edge 
not incident with v. By the above discussion on electrical resistance we see that this will not 
decrease Pr(?7^), where this is Uoo defined with respect to T* which is T less one edge, not 
incident with v. We can argue crudely that 



This is because there is an ^^ chance of a first move to a part of the tree that has branching 
factor r — 1 at every vertex. 

Let Ui = {X returns to the root vertex after starting at ^i/2}. Then, with Ft equal to the 
probability of a return by Wi, to v during [1, T], we have 

Ft < Pr(f/^) + TPr(t/*). (34) 

The RHS of (34) is at least the probability that W^, returns before reaching distance £i/2 or 
returns after reaching distance £i/2 at some time t < T. 



Now, using (29), we see that 



Pr(f^i) ^ (,_m/4 - (35) 



Here we have £i/4 in place of £i/2 to account for the one place where we move left with 
probability ^7^. We argue that at least one of the paths from t; to u; or u; to the boundary 
must be at least £i/4 and not use the vertex incident to the deleted edge. 
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Thus Ft<{2 + o(l))/r and since R^ < j^ we have R^ < ^^±^. D 



4.2 Proof of Theorem 4 

This section estabhshes the whp values of |7?.(t)| and Ds{t) for r(t). We also calculate the 
whp value of |W(t)|, where W(t) is the number of unvisited edges at step t. This gives us the 
value of \Rt\ = 2\U{t)\ (see Section 2), which we need for the proof of Theorem 5. 

Fix V E V and let N{v) = {wi,W2, ■ ■ ■ , Wr} and choose < s < r. Let 

P{v, s, t) = Prw({f , wi,..., Ws} C 7l{t) and {ws+i, • • • , Wr} C B{t)). 

Then 

Prvv;(t> G TZ{t) and has degree s in T{t)) = I )-P(f , s,t). (36) 

Lemma 9 is only valid after the mixing time T, and Corollary 10 gives precise results only 
after some t sufficiently larger than T. We assume henceforth that t > log n. We deal 
with the very beginning of the walk [t < T) in Section 4.5. Define the set TZrit) to be 
those vertices visited by the walk during steps T,...,t, and let BT(t) = ^ \ TZrii)- Thus 

7^(t) = 7^T(t) \ {>v,(o), >v,(i), . . . , >v„(T - 1)} 

We recall the definition of a nice vertex, as given at the start of Section 4.1. We will say an 
edge e = {m, u} is nice, if both m, v are nice. The next lemma gives enough information to 
compute the expected number of unvisited nice vertices, edges between nice vertices, and the 
expected degree sequence of nice vertices in TZ{t). 

Lemma 13. Let pt = e"^'^^"^^*)/'-'"'"^ as given by (3). Whp^j^, for all nice vertices v, and for 
all nice edges {u,v}, for t > log n, 

(a) Prw(^^ e 7^T(t)) = (1 + o(l))e-^. 



2t 

prn 



(b) Prwiiu, v} e Urit)) = (1 + o(l))e 

(c) Let 

Pt{v, s, t) = Pryv{{v, wi,..., Ws} C 7^r(t) and {ws+i, . . . , Wr} C Brit)), 

then 

PT{v,s,t) = il + o{l))e^^pt{l-pty-'. 



^We use the subscript G to emphasize that the probabiHty space is random r-regular graphs 



15 



Proof Part (a) follows directly from Corollary 10 using py from (18) and the value of Ry 
from 12(a). 

Part (b) follows similarly, by subdividing the edge e = {u, v} with an artificial vertex a to 
form the graph H. The stationary distribution of a in if is tt^ = 2/{rn + 2), and the value of 
Ra is given by Ry of Lemma 12(a). 

For part (c) we proceed as follows. Let X C N{v). Let 7x denote the contraction of {v} UX. 
Corollary 10 applies to •jx and 

(r-2)ir + {r-2)\X\) 
P^x = (1 + o{l)) r{r-l)n ' ^ ^ 

as we now explain. From (18), the expression for p^^ ~ ix^^jR^^. The stationary distribution 
of 7x is TT^^ = (|X| + l)/n. Since v is nice, the expected number of returns, is (up to a factor 
1 + o(l)), R^^ = j^] where / is the probability of return to the root jx of an infinite tree 
with branching factor r — 1 at each non-root vertex. Thus p.y^ = (1 + o(l))7r^^(l — /). It 
remains to calculate /. At the root there are |X| loops and r — |X| + (r — 1)|X| branching 
edges. This gives 

2\X\ {r-2)\X\+r 1 



r(|X| + l) r(|X| + l) r-1' 

and hence (37) above. 

Let N{v) = {wi, W2, . . . , Wr}, and let X = {wi, . . . , Ws} U Y where Y C {wg+i, ■ ■ ■ , Wr}. Then 
Pr>v({t;, wi,..., Ws} C TZrit) and {ws+i, . . . , Wr} C Brit)) 

= J2 (-l)"''Prw(({^, wi,..., Ws} UY)C IZrit)) 

YC{ws+i,...,t} 

where |X| = 0, ..., r — s. If s = 0, we suppose that {w, wi, . . . , Ws} = {v}. 
Thus, for s = 0, 1, ...,r, 
Prw({t^, wi,..., Ws} C nrit) and {ws+i, ■■■, Wr} C Brit)) = 

*- V y ^ YC[s+l,r] ^ \ J J 

(39) 
There are two cases. When t = 0{n) we can write (39) as 
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and (c) follows, since the terms above are ^2(1). When t/n — > oo we go back to (39) and 
observe that the sum is 1 — o(l) and thus 

Prvvd^', w^i, • • • , Ws] C TZrit) and {tf^+i, • • • , Wr} C BTit)) = 

( (r-2)2s + r(r-2) 

1 + o 1 exp -^ L ^ U 

I r[r — l)n 

as required. □ 

Proof of Theorem 4(a). 

Let t < to — un, where to = pnlogn and u satisfies the conditions of Theorem 4. 

Using (26), (27) we have whp, that \J7\ = ©(ra^'Mogn), and thus \J\f\ = n{l - o(l)). Let 
Z{t) = |7^(t) nAf\. As there are at most T = O(logn) vertices in 7^(t) \ 7^T(t), 

EZ{t) = r^(l-o(l))Prw(l^G7^^(t)) + 0(T) (40) 

= (l + o(l))ne~^. (41) 

Thus EZ(t) — ;■ cxD for t < pnlogn — un. 

We use the Chebyshev inequality to prove concentration of Z{t) for a single t < to — un. 

Suppose that log log n <t^ u' = uj'{n) <^ logn. We first show that 

Var(Z(t)) = 0(r"'EZ(t)) + e-'^'Y.{Z{t)f , (42) 

for some constant a > 0. 

Fix t and let E^ be the event that vertex v G TZ{t). We claim that if f , w are at distance at 
least u}' then 

Vv{£, n £^) = (1 + e-^("')) Pr(£,)Pr(£^). (43) 

To prove this we use Lemma 11. Let 5* = {f,i/;}, and let 7(5) be the contraction of S. For 
the random walk on the associated H we have 

-{R, + R^) <R^< -{R, + R^){1 + 0(Te-^("'))). 

2 2 

Indeed, the first move from 7 will be to a neighbour of v or w. Assume it is to a neighbour 
of V. The expected number of returns will be R^ plus R^ times the probability of a visit 
to w during the mixing time. Because v and w are at distance at least cu', using (32), the 
probability of a visit to w during T can be bounded by T{n~^ + X"^^^- Thus 



i?^ = p f 1 + e-^^'^')) , 7r^ = -, p^ = (l-0(Te-^("'))) — . (44) 

V / n on 
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Equation (43) follows on using Lemmas 9, 11 and equation (44). 
Thus 

E{Z\t)) = EZ{t)+ J2 Pr(^.n£^)+ Yl Pr(^.n£^) 

v,w v,w 

dist{v ,iv)'>u' dist{v ,11)) <u;' 

< EZ{t) + (1 + e-"^')E{Z{t)f + r'^'E(Z(t)) 
and (42) follows. Applying the Chebyshev inequality we see that 

Pr(|Z(t)-EZ(t)| >E(Z(t))e— /3) < ^^ + e"'^^'/'- (45) 

Provided t < to ~ con, EZ(t) > e^'^''^/2 and our choice of u' implies that the RHS of (45) is 
o(l) for such t. 

To prove the concentration of Z{t) simultaneously for all t < (1 — e)to, we proceed as follows. 
Let now u' = ii/2 where ii = ei log^n is given by (25). From (40)-(41), for t < (1 — e)pnlogn, 
we have EZ{t) = (1 + o(l))ne~*/'^" > n'-/2. We see that (45) becomes 

n^i^' paui' t/{pn) 

Pr(\Z(t) - EZ(t)\ > E(Z(t))e-'^" Z^) < ^— + e""" /^ = o(n-^), (46) 

n 

for some (small) S > constant. We can make S = (aeilogr)/4 provided we make e > 



^U' + I^j- 

We interpolate [0,to] at M = n^^'^ integer points Si,...,sm a distance a = to n~^l'^ apart 
(ignoring rounding). Let n{M) be the event that {\Z{t) -EZ{t)\ < E(Z(t))e-"'^'/3} holds 
simultaneously at Si, ..., sm- Then Pr(-i?^(M)) = o(n~'^/^) = o(l). 

The value of Z{t) is non-increasing, with EZ(t) = (1 + o(l))ne~*/''". Thus for any t, Si <t < 

■5i+l, 

Z{si) > Z{t) >Z(s,+i) 
(1 + o(l))e(*-^")/^" > 4^ > (1 - o(l))e(*""'+^)/''". 

But e*^*"'^')/^", e(*~'*'+i)/^" are both 1— o(l) and so the concentration result holds for nice vertices 
for t < (1 - e)to. 

Lemma 13 applies only to nice vertices. We next consider ATfi lZ{t). It follows from Lemmas 
9 and 12 that 

whp AT C BT{t) for t > lOeinlogn. (47) 
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However for t < lOeinlogn, 

EZ{t) > n^-^°'' > I AT] > IAT n 7^(t) I . (48) 

This completes the proof of part (a) of Theorem 4. 
Proof of Theorem 4(b). 
The proof of part (b) is similar to that of (a). Observe first that part (a) and Lemma 13 imply 

E{D'M = i^ + o{l))N,Qpt{l-p,r-\ (49) 

where -D^(t) is the number of nice vertices of degree s in r(t). 

Our next aim is to show 

E{D'^{t)) = e^("^) for (r,_,)^+^ < t < t, - wn (50) 

As a function of t, Ntpf{l — PtY^^ is log-concave and to bound E(D^(t)) from below we only 
need to check E(D^(t)) at the lower bound t = {rr^sY^^ and at the upper bound t = tg — un. 
See (6) for the definitions of Ts,ts. For t = o{n) we have pt = I — o(l) and Nt ^ n and 
then (50) is simple to verify at the lower bounds for t in (50). When t = ts — oon we have 
Pt = e p^ n sCj— 2)+'- and Nt = e~n «('— 2)+r- and (50) is also easy to verify. 

We can use the Chebyshev inequality to prove concentration. We let J^v{s) be the event that 
V is a vertex of degree s in T{t). We prove that for v,w & M , 

Pr(J-,(s) n J-^(s)) = (1 + e-^(-'))Pr(J-,(s))Pr(J-^(s)) (51) 

whenever v, w are at least u' apart, where u'/ log log n — t- oo and u = o(logn) and u' = o{uj).. 
We can argue for this by a small change in the argument for (43). This proves concentration 
for {Tr-sY^^ < i < ts+i—un. Let D^s(t) = Yl\=s-^kit)- This sum is monotone non- increasing 
with t. Simultaneous concentration of D>s[t) and hence Ds{t) follows from the interpolation 
method of part (a) applied to D>s(t). 

We must now argue that the contribution of AT is negligible. Equation (47) shows that whp 
AT C B(t) for t >Ti = lOeinlogn. On the other hand, by (49), for t = t* and any < s < r, 

, 10ei(r + s(r-2)) 

E{D',{t*)) = Q{n^ ^- ) > n^'K 

It follows from this and concentration of D'^(t) and (26) that D'^(t) ~ Ds{t) whp and the 
proof of part (b) is complete. 
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4.3 Proof of Theorem 3 

We combine Lemma 7 and Theorem 4 with the results of Molloy and Reed [19, 20]. We 
summarize what we need from these two papers: 

Theorem 14. Let Aq, Ai, . . . , A^ G [0, 1] be such that Aq + Ai + ■ ■ ■ + A^ = 1. Suppose that 
d = {di, d2, . . . , dN) satisfies \ {j : dj = s} \ = {1 + o{l))XsN for s = 0, 1, . . . , r. Let Gn,d be 
chosen randomly from graphs with vertex set [N] and degree sequence d. Let 

r 

L = ^s(s-2)A,. 



(a) If L < then whp Gn,d is sub-critical. 

(b) If L > then whp Gn,d is super- critical. Furthermore the unique giant component has 

size On where 6 is defined as follows: Let A = Yl\=i ^^s- Define a to be the smallest 
positive solution to 



A-2a-^sA, (1-=^) =0. (52) 

s=l 

Then 



2a 



s/2 



s=0 



2a 



s/2 



»=i-E^-(i-x) ■ («) 



We now evaluate L in the context of r(t). Then Theorem 4 implies that we can take 
^ = E n^t (1 - PtY-'sis - 2) = rptiir - l)pt - 1). 



s=0 



Thus the critical value for t is the one that gives L = and pt = :^. One can easily check 
that this is indeed the case for t* as defined in (1). Intuitively, this can be seen as follows. 
When we grow components outward from a given vertex, the branching factor is (r — l)pt, so 
when (r — l)pt < 1, all components are of finite size. 

Parts (i) and (ii) of Theorem 3 follow. Here is a quick check on the claimed size of Ci{t). First 
of all A = rpt > 0. We divide (52) by A and then let = 1 — ^ so that (52) becomes (5). 
With this value for we see that (53) becomes (4) and then whp |Ci(t)| ~ 9Nt as claimed. 

To prove (iii) we use the result of Hatami and Molloy [15] that when \L\ = 0{n~^^^) the size 
of the giant is n'^/^+°^^\ At each step, at most r vertices in T(t) change their degree and so L 
changes by 0{l/n). L starts out at r(r — 2) and so at some time it becomes equal to 0{n^^^^). 
This will happen at t ~ t* and the conditions of [15] will be satisfied. At this point whp there 
are B(n) vertices in r(t) and (iii) follows. □ 

20 



4.4 Proof of Theorem 5 

In this section we study the number of components of a given size. In principle one should 
be able to work this out from Lemma 7 and Theorem 4. This has proven more difficult than 
we anticipated. Instead, we try to estimate the number directly. We can use these lemmas 
though to argue that almost all small components are trees. Indeed if we fix t and condition 
on the values Ds = Ds{t) satisfying Theorem 4 then we have the following: 

Lemma 15. 

(a) If t < eiTilogn then whp there are at most n^^^ components of size k < eilogra that are 

not trees. 

(b) Ift> einlogn then whp there are no components of size k < log n that are not trees. 

Proof Let N = |7^(t)|. Applying Lemma 7 we see that the expected number of sets of k 
vertices that contain at least k edges is bounded by 



N^ fQ^ fr\k frNe' 



k \ k \nJ \ 2n 



k 



To prove (a) we take N = n and apply the Markov inequality. To prove (b) we take N = Nt- 

U 

With this in mind we concentrate on the number of tree components of size k for some 
k < ei log n. Since there are whp at most n^^^ vertices that are not nice, we will concentrate 
on counting the number of components that are made up of nice vertices only. We will also 
assume that t <to, see (50). 

The following is Lemma 4 of [2]. 

Lemma 16. [2] Let T be a labeled infinite r-regular tree. Let bk be the number of labeled 
subtrees of size k rooted at vertex v of T . Then 

[r — l)k 



2)k + 2\ k-l 



n 



Lemma 16 counts labeled sub-trees. Let T be such a tree. Each edge e = {x,y), with x 
closest to V, is associated with a label Ae indicating that it is the Ag-th edge incident with x, 
in numerical order. Now consider the situation described in Lemma 8. Fix v G TZ{t) fl A/". 
Assuming f G A/" introduces some conditioning on the allowable pairings described in Lemma 
8. However Pr(f G N") = 1 — 0{n'^'^^^^) (see (26)) and then we can use 

Pr(A)-Pr(^B) , , , , Pr(A) 
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for events A, B to correct (54) below for this conditioning. Consider the k neighbourhood of 
V in the multi-graph on [n] induced by a random pairing on R{t). It is a tree. Now delete 
any edge that corresponds to an edge (x, y) with x G 7l{t),y e B{t). Let T be the component 
that contains v. If T has k vertices then T corresponds to a tree component of r(t) with k 
vertices. 

So, fix a tree T* as counted in Lemma 16 and let us determine the probability that T = T*. 
The probability space for this calculation is as follows: Let Ft, Rt be as in Lemma 8. We have 
paired up the elements of Ft and we are now considering random pairings of Rt- To extend 
Ft to F, we can generate the pairing of Rt in any order we please. Thus we start by pairing 
elements of W^ the root of our tree. 

If we choose x G W^ then the probability it is paired with y G W^, for some z G 7^(i), is 

u = ^^^5^. Using Lemma (13)(a),(b), with |7^(t)| = {1 + o{l))Nt and \Rt\ = 2|W(t)| we 

obtain 

u = '''^^1^'"^ = (1 + o(l))e-('-2)*/(P™) = (1 + oil))pt, (54) 

\Rt\ — 1 

where pt is given by (3), and the o(l) term is o(log~ n) for any positive constant K. 

Suppose now that we have generated 0(log?T,) pairings. Then both |7^(t)| and \Rt\ change 
by O(logn) and, provided t < to{l — e), e > 0, since they are both of size Q{n). Choosing 
an unpaired x' G W^' for v' G TZit) we see that the probability of x being paired with some 
y' G Wz' where z' G TZit), is again (1 + o{l))pt. 

To estimate Pr(T = T*) we start at the root v and examine the points paired with W^, where 
Wjj C Rt, because v G 7l(t). The count in Lemma 16 assumes an ordering of the neighbours 
of each vertex and by implication an ordering of Wy and a statement about which members 
of Wv are paired with WfK^t) and which should not. Suppose we pair W^ with points from 
Wxi,i = 1,2, ... ,d. Then we continue by pairing up Wa,^ and then Wx2 and so on. The factor 
Pt~^ is from the k — 1 times we have to pair with W-jK^t) and the factor (1 — p^)(''~2)fc+2 jg ^Youi 
the number of times we do not. 

It follows that 

Pr(T = T*) = (1 + o{1))p1-\1 - PtY'-^^"^^. (55) 

It follows from (55) that 

EiN{k,t)) = (1 + o(l))iV,|p^i(l -p,)(^-^)'=+^. 

It remains to prove concentration around the mean. We use the Chebyshev inequality. We 
fix two vertex disjoint trees Ti,T2 in G. Arguing as above we see that 

2 

Pr(Ti,T2 are components of r(t)) < (1 + o(l)) 1 I Pr(Tj is a component of r(t)). 

1=1 
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Provided E(A^(A;,t)) — )• cxd and it does so for {Tk(r-2)+2y^^ ^ t < {1 — e)tfc we can use the 
Chebychev inequality to prove Theorem 5(a) for any fixed t. Note that here we have used 
concentration in the configuration model to imply concentration in the simple graph model. 

We now prove Theorem 5(b). For k constant, Fi{N{k,t)) = (l + o(l))?7(/c, t) > n^ , throughout 
the range en < t < {1 — e)tk-i. Concentration can be established by the methods used in 
Section 4.2. Let A be a large constant. For en < t < An we use the Chebychev inequality 
directly, and for An < t < (1 — e)tk-i we use the interpolation method. For en < t < An, 
N[k,t) > a^n where a^ is some constant. Using (45), for some 6 > 0, 



Pr{\N{k,t) -EN{k,t)\ >EN{k,t)o{n'^)) = o{n 



-S\ 



Interpolate en < t < An at h = n^'"^ points (ti, ..., t/^), i = (A — e)n^ ^'"^ apart. Let t G 
{tj,tj+i). Then \N{k,t) - N{k,tj)\ < ri and \T]ik,t) - vik,tj)\ = 0{i). It follows that, with 
probability 1 - o{n-^/'^), \N{k,t) - EN{k,t)\ < EN{k,t)o{n-^/^) for all en < t < An. For 
AN < t < {1 — e)tk_i we use the interpolation method of Section 4.2 as follows. Whp the 
maximum component size is O(logn) and there are O(logra) vertices on non-tree components. 
Let M{k, t) be the number of vertices on components of size at least k. Then M{k, t) is 
monotone non-increasing and 

O(logn) 

Mik,t) = Yl jNij,t)+Oi\ogn) 

j=k 

kN{k,t) = M{k-l,t)-M{k,t) + Oi\ogn). 
Applying the interpolation method to M{k,t) will complete the proof of Theorem 5(b). 



4.5 In the beginning 

We remind the reader that our proofs so far, we have assumed t > log n. We now consider 
the first few moves of the walk. Using Lemma 7 we see that for 1 < t < log^ n we have that 
r(t) is a random graph with a degree sequence of the following form: There are n — s vertices 
of degree r, where s < rt, and s vertices of degree < r. If the minimum degree in r(t) is at 
least one then whp we find that r(t) is connected. Indeed, let K- be the set of vertices of 
degree r in T{t). We argue that whp 

Vr induces a connected subgraph of T(t). (56) 

Each X G TZit) \ K is adjacent to K-. (57) 

For k even let 

0(A:) 



(A;/2)!2'=/2 
be the number of ways of partitioning [k] into k/2 pairs. 
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Let m = O(logn) be the sum of the degrees, in T(t), of the vertices in TZit) \ V^. Then 
Pr((56)/a^/.) < ^ f f "1 h) ^(^r + mju - s- k)r + m - I) 



k=3 1=0 

n/2 m / \ / \ ((r{n-s)+m)/2\ 

n\ m\ \ (kr+i)/2 ) 



k I \ I I (r{n—s)+m\ 
^ ^ ^ I kr+l ) 

n\ fm\ 1 



fc=3 1=0 

n/2 rri 

SEE 

fc=3 /=0 ^ ^ ^ ^ \ (kr+l)/2 

"■/2 ™ / \ / \ / 1 1 7 \ {kr+l)/2 

^-^ ^-^ / n \ / m \ / kr + l ^ 

fc=3 /=o 
= 0(1). 



k \ I (Hn-s)+m)/2\ 
'' ^ '' \ (kr+l)/2 J 



I J V T"in — s) + m 



Explanation of (58): Choose a set of k vertices S of degree r and / points from the m points 
in W associated with vertices T of degree less than r. (Some of the points associated with 
these latter vertices have already been paired). Now pair up the kr + / points randomly and 
the remaining points randomly. The I points contain the edges between 5* and T. 

The probability that (57) fails is 0(\ogn/n). A vertex of TZit) \ K of degree d has an 0{n~'^) 
chance of not being connected to K-. 

So we only have to deal with the possibility that there are isolated vertices in T{t) for t < log n. 
So consider the event 

A{t) = {3v G N{Wt) : V G 7^(t) and N{v) C B{t)} . 

We claim that 

Pr(^(^)) = o ((^r^^) ■ (59) 

It follows that 

log n 

Pr((57) fails) < ^ Pr(^(t)) = o(l). 
t=i 

To prove (59) fix t and a neighbour v oi Wt- Equation (27) implies that there is at least one 
neighbour w of f that is not contained in a small cycle. li w ^ Xt then to reach w the walk 
W must emulate a walk on the infinite tree T that starts at distance ii/2 from the root and 
visits it. This has probability l/(r — 1)^^'^ and this must be infiated by log n to account for 
log n possible starting times, li w = Xt then to visit another neighbour of v then we must 
first reach distance at least ii/2 and then we can repeat the argument and use inequality (35). 
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5 The evolution of r{t) in Gn,p and Dn^p 

We first consider Gn,p and prove Theorem 1. 

We first note some properties of tlie degree sequence dciv) of Gn,p- We assume that c = n^^^ 
where w — > oo. Let Ui = log^^'^n. For a fixed vertex v, its degree dciv) satisfies 

Pr{\dG{v) - clogn\ > uJi{clognY^^) < 2e"'^i/l 

This follows from Chernoff bounds on the tail of the binomial distribution. Denote by A^, 
vertices of Gn,p which have degrees in the range clogn ± a;i(clogn)^/^. By the Markov 
inequality, we see that whp |-i(A/'d)| = (ne""^!/^). 

Let |7^(t)| = A^. Because T(t) has the distribution Gj\[^p, we only need good estimates of 
N. We can get these using Lemma 9. Fix a vertex v G J\fd- It is shown in [9] that whp 
R^ = 1+0(1/ log n) for ally G V. Let e = l/(logn)i/6), then for t; e ^fd, n^ = (l+0(e))(l/n). 
Thusp, = (l + 0(e))(l/n). 

For t > log n, (22) implies that 

Pr(i7 G 7^(t)) = (1 + 0(logn/n))e-(i+^(^))*/". 

Recall that te = n(loglogr2 + (1 + 6*) logc), and assume that t = te, where 6 = 0(1) then 

E(|A^,n7^(t)|) = (1 + 0(1))^,:^. 

Regarding concentration, we can argue as in the proof (43) that ii v,w E Md are at distance 
at least 071/2 in G then 

Vv{£, n £^) = (1 + o(l))Pr(£,)Pr(£^). (60) 

where £^ = {v E lZ{t)}. 

Then if X = |7^(t) n Nd\ then 



E(X(X - 1)) < ^ Y, Pr(£, n£^)+Y, Yl P^(^- ^ ^-) 

veAfa weNd v&Afd w&Nd 

dist{v,w)>uJi/2 dist{v,w)<LUi/2 

<(1 + «(1))E E Pr(^.)Pr(^J + E E Pr(^^) 

veAfd w&Afd 

dist{v ,w)>LOi /2 

< (1 + o(l))E(X)2 + 0((c logn)"^/2)E(X) 



v^Md weAfd vGAfd weAfd 

dist{v,w)>L0i/2 dist{v,w)<uii/2 
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which imphes that Var(X) = o(E(X)^) and then the Chebyshev inequahty imphes that 
X - E(X) ~ ,,^/L_ whp. 

The vertices outside Md only contribute o{N) whp and thus 

N{te) = (1 + o(l))EiV(te) = (1 + o(l)K(c^+^logn). 

The threshold for the giant component in Gjsf^p is at Np = 1, i.e. as 6 ^ from below. 
Theorem 1 follows immediately from this. 

We next consider Dn,p and prove Theorem 2. As the details are similar to those above, 
our discussion will be brief. If np = clogn and (c — 1) logn — )■ oo then whp Dn,p is strongly 
connected, so a random walk on -D„,p is ergodic. It was established in [10] that if (c— 1) logn — )■ 
oo, then almost all vertices v have have stationary distribution 7r„ = (1 + o{l))/n. By the 
method of deferred decisions, r(t) is a random digraph DN(^t),p on N(t) = |7?.(t)| vertices, 
where N{t) is given as above. It was proved in [17] that the threshold for the emergence of a 
giant strongly connected component in Dn,p is at np ~ 1. Theorem 2 follows. 
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